CDS
Accession Number | TCMCG075C04893 |
gbkey | CDS |
Protein Id | XP_017971996.1 |
Location | complement(join(1765394..1765498,1765587..1765865,1765999..1766274,1766430..1766648,1766828..1766996,1767267..1767441,1767519..1767630,1767992..1768243)) |
Gene | LOC18607298 |
GeneID | 18607298 |
Organism | Theobroma cacao |
Protein
Length | 528aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018116507.1 |
Definition | PREDICTED: squalene monooxygenase [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | I |
Description | squalene |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R02874
[VIEW IN KEGG] |
KEGG_rclass |
RC00201
[VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K00511
[VIEW IN KEGG] |
EC |
1.14.14.17
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00100
[VIEW IN KEGG] ko00909 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] ko01130 [VIEW IN KEGG] map00100 [VIEW IN KEGG] map00909 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] map01130 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGGATACGAGTGTATAGTTGAAGGGCTGGTAGCCGCTCTGCTGGGGTTTGTTTTCTTGTACAACGCTTTCGTAAGAGGATTCAACAAGACAAAAGCTGCAGCAGGTGCTGCTTCTTCTTCTTCTTCCATGGTGTCTCCAATGGAAAATTGTGTGAGGAAAACAGGGAATGGCGAGGTTGCTGGGAGTACAGACATTATCATAGTTGGCGCTGGAGTTGCTGGTTCTGCTCTTGCTTATACATTTGGAAAGGACGGACGCCGAGTGCATGTGATAGAGAGAGACTTAAGTGAGCCTGACAGAATTGTTGGTGAACTTCTACAACCAGGGGGCTACCTTAAGTTAATTGAGTTGGGTCTTGAAGATTGTGTAGATGACATTGATGCTCAACAGGTTTTTGGCTATGCTCTGTACAAGGATGGAAAGAATACCAGGTTGTCTTATCCCCTGGAAAAGTTTCACTCTGATGTTGCTGGAAGAAGCTTCCACAATGGACGTTTCATACAAAGGATGCGGCAGAAAGCTGCTTCTCTTCCCAATGTAACTCTAGAACAAGGAACAGTAACATCTCTGCTTGAAGAAAATGCGACTATCAAGGGAGTTCAGTACAAAACTAAGGGTGGTCAAGAGTTGACAGCATATGCTCCCCTTACTATTGTATGCGATGGTTGTTTCTCAAATTTGAGACGCTCTCTCTGTGACCCGAAGGTTGAGGTCCCCTCTTGTTTTGTTGGATTGGTTCTGGAGAACTGTGAGCTTCCGCATGCAAACTATGGACATGTTATATTGGCAGACCCTTCACCTATCTTGTTTTACCCTATCAGCAGCACCGAGATTCGTTGCTTGGTTGATGTGCCTGGCCAAAAAGTTCCTTCTGTTTCCAATGGTGAAATGGCCCAGTACTTGAAAACTGTGGTGGCTCCCCAGATTCCTTCTGAACTGCACACTGCCTTTATATCCGCAATTGATAAGGGCAACATAAGAACCATGCCAAACAGAAGCATGCCTGCTGCTCCACACTCAACTCCTGGTGCACTTTTAATGGGTGATGCATTCAATATGAGACATCCTTTAACCGGAGGGGGAATGACTGTTGCACTATCTGATATTGTGGTACTAAGGGATCTTCTAAGACCCCTGTACGATCTGTATGATGCATCTACTCTTTGCAAATACCTTGAATCTTTTTATACCTTGCGGAAGCCAGTGGCATCTACAATAAACACATTGGCTGGCGCCCTATACAAGGTATTCAGTGCCTCCCCTGATCCAGCAAGGAAGGAGATGCGGCAAGCATGCTTTGACTACTTGAGCCTTGGAGGCGTATTCTCAAATGGACCAATCTCTCTGCTCTCCGGTTTGAACCCCCGCCCCATAAGCTTAGTCCTACATTTTTTCGCGGTGGCTGTCTATGGCGTTGGCCGCTTGTTACTTCCATTTCCTTCACCCAAACGCATTTGGACTGGGGCTAGATTGATTTCGGGTGCATCAGGCATCATTTTCCCCATTATCAAGGCTGAAGGGGTTAGACAAATGTTTTTCCCTGCAACTGTGCCAGCATACTACAGAGCTCCTCCTGTTCATTGA |
Protein: MGYECIVEGLVAALLGFVFLYNAFVRGFNKTKAAAGAASSSSSMVSPMENCVRKTGNGEVAGSTDIIIVGAGVAGSALAYTFGKDGRRVHVIERDLSEPDRIVGELLQPGGYLKLIELGLEDCVDDIDAQQVFGYALYKDGKNTRLSYPLEKFHSDVAGRSFHNGRFIQRMRQKAASLPNVTLEQGTVTSLLEENATIKGVQYKTKGGQELTAYAPLTIVCDGCFSNLRRSLCDPKVEVPSCFVGLVLENCELPHANYGHVILADPSPILFYPISSTEIRCLVDVPGQKVPSVSNGEMAQYLKTVVAPQIPSELHTAFISAIDKGNIRTMPNRSMPAAPHSTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLRPLYDLYDASTLCKYLESFYTLRKPVASTINTLAGALYKVFSASPDPARKEMRQACFDYLSLGGVFSNGPISLLSGLNPRPISLVLHFFAVAVYGVGRLLLPFPSPKRIWTGARLISGASGIIFPIIKAEGVRQMFFPATVPAYYRAPPVH |